APPENDIX 



Original SEQ ID NOS 1-3 remain the same; 

Original SEQ ID NOS 4-6 are now SEQ ID NOS 4-5, 6-7 & 8-9 respectively 
Original SEQ ID NO: 7 is now SEQ ID NO: 10; respectively, 
Original SEQ ID NO: 8 is now SEQ ID NO: 1 1 
Original SEQ ID NO: 9 is now SEQ ID NO: 12; 
Original SEQ ID NO: 10 is now SEQ ID NO:13; 

*Please note that the part or all of the CDS regions for original SEO m NOS 8-1 A w*™ -j*k- • 

Original SEQ ID NO: 1 1 is now SEQ ED NOS 14-15- 

Original SEQ ID NO: 12 is now SEQ ID NOS 16-17;' 

Original SEQ ID NO: 13 is now SEQ ID NOS 18-19; 

Original SEQ ID NO: 14 is now SEQ ID NOS 20-21 • 

Original SEQ ID NOS 15-17 are now SEQ ID NOS 22-24- 

Original SEQ ID NOS 18-19 are now SEQ ID NOS 25-26 & 27-28 respectively 

Original SEQ ID NO: 20 is now SEQ ID NO:29; respectively, 

*See above note re: SEQ ID NOS 8-10. 

Original SEQ ID NOS 21-22 are now SEQ ID NOS 30-3 1 & 32-33 respectiverv 
OriginalSEQIDNO:23isnowSEQIDNO:34; ^respectively, 
*See above note re: SEQ ID NOS 8-10. 

Original SEQ ID NO: 24 is now SEQ ID NO:35; 
Original SEQ ID NO: 25 is now SEQ ID NOS 36-37; 
*See above note re: SEQ ID NOS 8-10. 

Original SEQ ID NO: 26 is now SEQ ID NO:38; 
*See above note re: SEQ ED NOS 8-10. 

Table 1 encompasses SEQ ID NO:39; 
Table 3 encompasses SEQ ID NO:40; 

Table 4 encompasses SEQ ID NOS 24, 35 & 41-45, respectively. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Yokoi, Haruhiko 

Shiotsu, Yukimasa 
Konishi, Noboru 

(ii) TITLE OF INVENTION: NOT^L POLYPEPTIDES 

(iii) NUMBER OF SEQUENCES :( 4 5 ) 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 North Glebe Rd. 8th floor 

(C) CITY: Arling-on 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP : 22201-4741 

(v) COMPUTER READA3LE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM ?C~ compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/765,337 

(B) FILING DATE: 23-DEC-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/ JP96/01157 

(B) FILING DATE: 26-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP P . HEI . 7-102 625 

(B) FILING DATE: 26-APR-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Crawford, Arthur R . 

(B) REGISTRATION NUMBER: 25,327 

(C) REFERENCE /DOCKET NUMBER: 249-89 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-816-4000 

(B) TELEFAX: 703-816-4100 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



58 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
1 5 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Aso He Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 HO 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Vai Arc Arg Ala Pro/Thr Tyr ArgAla 
i45 150 155 / / 160 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
165 170 175 

Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
180 185 190 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
195 -200 205 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
210- 215 220 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
225 230 235 240 

Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr 
245 250 255 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
260 265 270 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
275 280 285 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
290 295 300 



59 



305 HiS LSU Gln SSr Phe LeU Ser Tyr Arg 



305 310 

Val Leu Arg His Leu Ala Gin Pro 
325 

!) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



315 320 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lvs Leu Leu 
5 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 25 so 

His Pro Leu Pro Thr Pro Val Leu Leu Pr 



35 



o Ala Val Asp Phe Ser Leu 
40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 



55 



60 



Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
65 70 75 Y go 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 go 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 no 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 i 4 o 

Val Gly Gly Ser Thr Leu Cys Val Arg/&y Gly Gly Ser Gly Gly Gly 
145 150 y f 155 leo 

Ser Gly Gly Gly Ser Arabia Pro Thr Tyr Arg Ala Ser Ser Leu Pro 
165 # 170 175 



Gin Ser Phe Leu Leu Lys Ser Leu Glu 
180 



Gin Val Arg Lys He Gin Gly 
185 190 



Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys 
195 200 or. c 



205 



60 



His Pro Glu ^ Leu v .l Leu Leu Gly His Ser Gly Ile Pro Trp 

Z15 220 

Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 

235 240 

Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin 
245 250 255 

Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Aso Thr Leu 
260 265 270 

Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu 

280 285 

Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro 

295 300 

Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala 

310 315 320 

Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His 
325 330 335 

Leu Ala Gin Pro 
340 

INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lvs Leu Leu 
5 10 " 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro Glu Val 
20 2 5 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asd Phe Ser Leu 
35 40 45 - 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin 
5 70 75 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu - 
100 105 il0 

61 



Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu S.er Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly Gly Gly Ser Gly Gly 
i45 150 155 160 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala Pro Thr Tyr Arg Ala 
1^5 170 175 

Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val Arg 
180 185 190 

Lys He' Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr 
195 200 205 

Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu 
210 215 220 

Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin 
225 230 235 240. 

Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr Gin 
245 250 255 

Gly Leu Leu Gin Ala Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr 
260 265 270 

Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp 
275 280 285 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin 
290 295 300 

Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 
305 310 315 320 

Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg 
325 330 335 

Val Leu Arg His Leu Ala Gin Pro 
340 

INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1..63 
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(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 64.. 1047 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1047 



CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 
Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arq Glv Lys Val 
125 130 135 



48 



96 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 
Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 
"21 "20 -is _ 1Q 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Ara Val 
" 5 1 5 io 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 14 4 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 
Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu P^-o Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 24 0 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
4 5 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 283 
Ala Gin Asp He Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 go 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 
Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 4 32 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
HO us 120 
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CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG CGG GCG 528 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala 
140 145 150 155 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 57 6 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
160 165 170 



63 



2 S c £ 5 ft? S S « 5 5 S t. JS Gin X* « 

180 185 

AAG CTG TGT GCC ACC TAG AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 
Lys Leu Cys Ala Thr T yr Lys Leu Cys His Pro Glu gTu £u vll Leu 

195 200 

Leu Tly Ss Ser Leu Glv ^ f C V* ^ CCC CTG AGC AGC TGG CCC 
y Hxs Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 

^10 215 

rf CAG CTG GCA GGC TGC TT ^ AGC CAA CTC CAT AGC GGC 

Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu £s Jer Gly 

225 230 235 

Leu lit Leu ?yr Gin gT ^ ^ ^ ° TG GAA GGG ATC TCC CCC 

SU Tyr G l" Gly Leu Leu Gln Ala Leu Glu Gly He Ser Pro 
■ 240 245 250 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp v5 Ala Asp HI 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT err 
Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu G g Jet Ala Pro £« 

275 280 

Leu Gin Pro £hr Gil gT f ? C CCG GCC TTC GCC TCT GCT TTG CAG 

eu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 

290 295 

25 2J S SJ SJ S S si S 2 S S S2 E S S 

305 310 315 

Su 51? l CG I AC GTT CTA CGC CAC CTT ^CC CAG CCC 

Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin *ro 

3 20 325 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 



Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 



672 



720 



768 



816 



864 



912 



960 



1008 



1047 
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Gin Cys Pro Glu His Pro Leu Pro Thr Pro Val L^Lu Pro Ala 

35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 

50 55 
Ala Gin Asp He Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 



70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

85 90 



Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 



100 



Ser Leu 



105 



Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

x15 120 

Pro Asn Ala He Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala 

145 150 155 



Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 

165 170 

Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 

175 180 185 

Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 

195 200 
Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 



215 



Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 

225 230 235 

Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro 

240 245 250 

Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 



260 265 



Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
270 275 280 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 

290 295 

Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 

315 



305 310 



Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
320 325 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

65 



(B) TYPCTBIfucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1 . . 63 

(ix) FEATURE: 

(A) NAME /KEY: mat_?eptide 

(B) LOCATION: 64 1083 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1083 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 6: 

ATG GAG CTG ACT GAA TTG CTC CTC GTG GTC ATG CTT CTC CTA ACT GCA 4 8 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

-21 -20 -15 -10 

AGG CTA ACG CTG TCC AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC 96 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 
-5 15 10 

CTC AGT AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG AGC 14 4 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC CTG CTG CCT GCT 192 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 
30 35 40 

GTG GAC TTT AGC TTG GGA GAA TGG AAA ACC CAG ATG GAG GAG ACC AAG 240 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

GCA CAG GAC ATT CTG GGA GCA GTG ACC CTT CTG CTG GAG GGA GTG ATG 288 
Ala Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
60 65 70 75 

GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 336 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 384 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 4 32 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
110 115 120 
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CCC AAT GCC ATC T^^TG AGC TTC CAA CAC CTG CTC CG^^GA AAG GTG 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTC AGG GGT GGC 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 
140 145 150' 155 

GGT TCT GGA GGT GGT TCC GGA GGG GGT TCT AGA GCA CCA ACA TAT CGC 
Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 
160 165 170 

GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC TTA GAG CAA GTG 
Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 
175 180 185 

AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC 
Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
190 195 200 

ACC. TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC TCT 
Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 
205 210 215 

CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC AGC CAG GCC CTG 
Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cvs Pro Ser Gin Ala Leu 
220 225 230 235 

CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC 
Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
240 245 250 

CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC 
Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro 
255 260 265 

ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 
Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 
270 275 280 

TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG CAG CCC ACC 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr 
285 290 295 

CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG CGC CGG GCA GGA 
Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly 
300 305 310 315 

GGG GTC CTA GTT GCC TCC CAT CTG CAG AGC TTC CTG GAG GTG TCG TAC 
Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 
320 325 330 

CGC GTT CTA CGC CAC CTT GCC CAG CCC 
Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 amino acids 
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(B)^^Pe: amino acid 
(D) ^TOOLOGY: linear 



^PC 

(ii) MOLECULE TYPE: protein 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

~ 15 -10 
Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val 

15 10 
Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu Hi 



15 



20 



His Ser Arg Leu Ser 
25 



Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala 



35 



40 



Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 



55 



Ala Gin Asp He Leu Gly Ala Val Thr Leu Leu I 



60 £| X " inr ^ eu Le ? Leu Glu Gly Val Met 



70 



75 



Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 

20 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 



100 



105 



Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 

113 120 

Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Gly Gly 

145 150 155 

Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg Ala Pro Thr Tyr Arg 

160 170 

Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser Leu Glu Gin Val 

180 185 

Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 

195 200 

Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser 

210 215 
Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cvs Pro Ser Gin Ala Leu 

225 230 235 

Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 



245 250 



Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu Leu Gly Pro 

260 



265 
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Thr Leu Asp Thr Gin Leu Asp Val Ala Asp P he A^hr Thr Ile 

Zlb 280 

Trp Gin Gin Met Glu Glu Leu Gly Met Ala Prn ai a T 

285 9Q , y ec Aia Pro Ala L «u Gin Pro Thr 

290 295 

Gin Gly Ala Met Pro Ala Pne Ala Ser Ala Phe. Gin Arg Arg Ala Gly 

310 315 
Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu Val Ser Tyr 

325 330 

Arg Val Leu Arg His Leu Ala Gin Pro 
335 340 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE; nucleic acid 

(C) S.TRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: mat_oeDtide 

(3) LOCATION: 64.. 10 95 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..I095 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

2 2 2 £ 2 2 2 s « « j» - s « s „ 
2 2 £ 2 2 2 2 S 2 2 E 5 2 2 2 2 
2 2 2 2 2 2 2 2 2 2* 2 2 2 2 2 2 



20 



25 



30 35 

2 2 2 2 2 2 2 2 £ 2 2 2 2 £ 2 
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2 2 2 S 2 2S 2 2 £ 2 CTG CTG GSG "* GTG »™ 

6Q sp i±e Leu oly Ala Val Thr Leu Leu Leu Glu Gly Val Met 

65 70 



48 



96 



144 



CAG TGC CCA GAG GTT CAC CPT ttp rrv ^ m 

«n cys Val 2 2 2 2 £ 2 2 2 2 2 2 



240 



288 



75 
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GCA GCA CGG GGA CAA CTG GGA CCC ACT TGC CTC TCA TCC CTC CTG GGG 
Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly 
80 85 90 

CAG CTT TCT GGA CAG GTC CGT CTC CTC CTT GGG GCC CTG CAG AGC CTC 
Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
« 100 105 

CTT GGA ACC CAG CTT CCT CCA CAG GGC AGG ACC ACA GCT CAC AAG GAT 
Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
HO us 120 

CCC AAT GCC ATC TTC CTG AGC TTC CAA CAC CTG CTC CGA GGA AAG GTG 
Pro Asn Ala He Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 
125 130 135 

CGT TTC CTG ATG CTT GTA GGA GGG TCC ACC CTC TGC GTA CGG TCC GGA 
Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

GGT GGC TCT GGC GGT GGT TCT GGT GGC GGC TCC GGA GGC GGT CGT GCG 
Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170 

CCA ACA TAT CGC GCC TCG AGT CTA CCA CAG AGC TTC CTT TTA AAA AGC 
Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
175 130 185 

TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG CTC CAG GAG 
Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 
ISO 195 200 

AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG 
Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu 
2 °5 210 215 

CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 
Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro 
220 225 230 235 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC CAT AGC GGC 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly 
240 245 250 

CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC CTG GAA GGG ATC TCC CCC 
Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro 
255 260 265 

GAG TTG GGT CCC ACC TTG GAC ACA CTG CAG CTG GAC GTC GCC GAC TTT 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 
27 0 275 280 

GCC ACC ACC ATC TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC 
Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 
285 290 295 

CTG CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT TTC CAG 
Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 
300 305 310 315 
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CGC CGG GCA GGA 
Arg Arg Ala Gly 



GAG GTG TCG TAC 
Glu Val Ser Tyr 
335 



^^GTC CTA GTT 

Gly Val Leu Val 

320 

CGC GTT CTA CGC 
Arg Val Leu Arg 



GCC TCC CAT CTG 
Ala Ser His Leu 
325 

CAC CTT GCC CAG 
His Leu Ala Gin 
340 



CAG AGC TTC CTG 
Gin Ser Phe Leu 
330 

CCC 
Pro 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala 

~ 21 -20 -is - 10 

Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Ara Val 

■ 1 5 16 

Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser 
15 20 25 

Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val Leu Leu *ro Ala 
30 35 40 

Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 
45 50 55 

Ala Gin Asp He Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Va^ Met 
60 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu L«u Gly 
80 85 90 

Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu 
95 100 105 

Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp 
HO 115 120 

Pro Asn Ala He Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val 

130 135 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Ser Gly 
140 145 150 155 

Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Arg Ala 
160 165 170 

Pro Thr Tyr Arg Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Ser 
I 75 180 i 8 5 

Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu 
I 90 195 200 
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L» LJ« cys M . ^ Tyr Lys Cys au Pro Glu G l» ^ Val Leu 

210 215 
Leu Gly His Ser Leu Gly lie Pro Tro ai* t 

220 22 | ^ ro Trp Ala Pro Leu Ser Ser Cys Pro 

230 235 
s.r Gin »l. l. u Leu A1 , Gly cys ^ ^ s ^ ^ 

L« Phe g eln Gly L „ Leu Jla ^ ^ 

260 265 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe 

z/b 280 

Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 

290 295 

Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin 

310 315 
Arg Ar g Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu 



325 330 



Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 4 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA " 

(ix) FEATURE: 

(A) NAME /KEY : sig^oeDtide 
(3) LOCATION: 27. .44* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTCTCCAAGC TTGAATTCCG GCCAGAATGG AGCTGACTGA ATTG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 47 base Dairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



44 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTAGAGGTAC CGCGGCCGCT TACCCTTCCT GAGACAGATT CTGGGAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: .SEQ ID NO: 12 : 
TGAACCTCTG GGCACTGGCT CAGT 

24 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCTGCCTGCT GTGGACTTTA GCTT 

24 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
(3) LOCATION: 1..24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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TGT TGG AAG CTC JI^AAG ATG GCA 

Cys Trp Lys Leu Arg Lys Met Ala 
1 5 



24 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Trp Lys Leu Arg Lys Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCT GAT GCT TGT AGG AGG GTC CAC 
Pro Asp Ala Cys Arg Arg Val His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Pro Asp Ala Cys Arg Arg Val His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
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(B) TYPET nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCA AGA GTT CGT GTA TCC TGT TCA 
Ser Arg Val Arg Val Ser Cys Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser Arg Val Arg Val Ser Cys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GAA TGG AAC TCG TGG ACT CTT TCC 
Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) WBgtH: 8 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:21: 

Glu Trp Asn Ser Trp Thr Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid ' 

(A) DESCRIPTION: /desc = "synthetic DNA" 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: pep-ide 
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(xi) SEQUENCE RIPTION: SEQ ID NO: 24: 



Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: join(1..3, 43.. 66) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TGC TCTAGAACCG CCTCCGGAAC CACCTCCAGA ACCGCCACC CCT GAC GCA GAG 54 
Cys Pro Asp Ala Glu 

1 5 

GGT GGA CCC TCC 66 
Gly Gly Pro Ser 



; (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Pro Asp Ala Glu Gly Gly Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA r 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GGTTCCGGAG 'GCGGTTCTAG A GCA CCA ACA TAT CGC GCC TCG AGT 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ala Pro Thr Tyr Arg Ala Ser Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CATTCCGCGG GGTACCGCGG CCGCTCAGGG CTGGGCAAGG TGGCGTAG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



(B) LOC 



N: 22. .45 
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GGC TGC TTG AGC CM CTC CAT AGC 
Gly Cys Leu Ser Gin Leu His Ser 
1 5 



24 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Cys Leu Ser Gin Leu His Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GAC CCA ACT CGG GGG AGA TCC CTT 24 
Asp Pro Thr Arg Gly Arg Ser Leu 
1 > 5 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asp Pro Thr Arg Gly Arg Ser Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 57 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 



(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace (25, ""} 

(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace ( 33 34 , " " ) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TAGACTCGAG GCGCGATATG TTGGCGCCCG CCGTACGCAG AGGGTGGACC CTCCTAC 57 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amine acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..6 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 7.. 57 



80 



(D) ©^^INFORMATION: /product- "linker^ptide" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1. .5 

(D) OTHER INFORMATION: /product- "SplI" 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_f eature 

(B) LOCATION: 7 . . 12 

(D) OTHER INFORMATION: /product- "Mrol" 

(ix) FEATURE: 

(A) -NAME/KEY: misc_f eature 

(B) LOCATION: 4 3.. 4 8 

(D) OTHER INFORMATION: /product- "Mrol" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 58.. 61 

(D) OTHER INFORMATION: /product- "Bbel" 

(ix) FEATURE: 

(A) NAME /KEY: mutation 

(B) LOCATION: replace ( 4 . . 5 , "") 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GTA CGG TCCGGAGGTG GCTCTGGCGG TGGTTCTGGT GGCGGCTCCG GAGGCGG^CG 56 
Val Arg 
1 

TGCGC 

61 



(2) INFORMATION FOR SEQ ID NO: 37: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Val Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 1..51 

. (D) OTHER INFORMATION : /product^ "linker peptide" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 53 

(D) OTHER INFORMATION: /product^ "SplI" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 10.. 15 

(D) OTHER INFORMATION: /product^ "Mrol" 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 46.. 51 

(D) OTHER INFORMATION: /product^ "Mrol" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACGACCGCCT CCGGAGCCGC CACCAGAACC ACCGCCAGAG CCACCTCCGG ACC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pep-ide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Xaa Thr Pro Leu Gly Pre Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu 
1 5 10 15 

Lys Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu 
20 25 30 

Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Gly Leu 
35 40 45 

Val Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
50 55 60 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His 
65 70 75 80 

Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie 
85 90 95 

Ser Pro Glu Leu Gly Pre Thr Leu Asp Thr Leu Gin Leu Asp Val Ala 
100 105 110 
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Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala 
H5 120 125 

Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 
130 135 140 

Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser 

145 150 155 160 

Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DeDtide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu 
1 5 10 15 

Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pre Glu Val 
20 25 30 

His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp Phe Ser Leu 
35 40 45 

Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin Aso He Leu 
50 55 60 

Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met Ala Ala Arc Gly Gin 
65 70 75 " 80 

Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin 
85 90 95 

Val Arg Leu Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu 
100 105 . HO 

Pro Pro Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala He Phe 
115 120 125 

Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 
130 135 140 

Val Gly Gly Ser Thr Leu Cys Val Arg Arg Ala Pro Pro Thr Thr Ala 
i45 150 155 160 

Val Pro Ser Arg Thr Ser Leu Val Leu Thr Leu Asn Glu Leu Pro Asn 
165 170 175 
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Arg Thr Ser cxy Leu Leu Glu Thr Asn Phe Thr A^Ser Ala Arg Thr 
180 185 190 

Thr Gly Ser Gly Leu Leu Lys Trp Gin Gin Gly Phe Arg Ala Lys He 
195 200 205 

Pro Gly Leu Leu Asn Gin Thr Ser Arg Ser Leu Asp Gin He Pro Gly 
210 215 220 

Tyr Leu Asn Arg He His Glu Leu Leu Asn Gly Thr Arg Gly Leu Phe 
225 230 235 240 

Pro Gly Pro Ser Arg Arg Thr Leu Gly Ala Pro Asp He Ser Ser Gly 
245 250 255 

Thr Ser Asp Thr Gly Ser Leu Pro Pro Asn Leu Gin Pro Gly Tyr Ser 
260 265 270 

Pro Ser Pro Thr His Pro Pro Thr Gly Gin Tyr Thr Leu Phe Pro Leu 
275 280 285 

Pro Pro Thr Leu Pro Thr Pro Val Val Gin Leu His Pro Leu Leu Pro 
290 295 300 

Asp Pro Ser Ala Pro Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn Thr 
305 310 315 320 

Ser Tyr Thr His Ser Gin Asn Leu Ser Gin Glu Gly 
325 330 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ser Gly Gly Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCLf^ESCRIPTION: SEQ ID NO:42: 

Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ser Gly- Giv Gly 
1 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Gly Gly Ser Giy Gly Gly Ser 
1 5 
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